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Abstract. Motivated by Euler's observation that the polynomial x 2 + x + 41 
takes on prime values for < x < 39, we search for large values of x for which 
N = x 2 + x + 41 is prime. To apply classical primality proving results based 
on the factorization of TV — 1, we choose x to have the form g(y), chosen so 
that g(y) 2 + g(y) + 40 is reducible. Our main result is an explicit, 60,000 digit 
prime number of the form x 2 + x + 41. 



1. Introduction and Statement of Results 

In 1772, Euler wrote to Johann Bernoulli and mentioned his observation that 
f(x) — x 2 + x + 41 takes on prime values for < x < 39. Even after this point, 
f(x) continues to take on a high frequency of prime values. For instance, among 
the numbers /(l), /(2), /(3), . . ., /(10 6 ), 261080 of them are prime. This is more 
than three times the number of primes in the sequence 1, 2, 3, . . ., 10 6 . 

In 1913, Rabinowitsch [8] proved that n 2 + n + A is prime for < n < A — 1 if 
and only if the ring 

1 + iy/D 



Z. 



is a principal ideal domain, where D = AA — 1. It is a very deep result of Baker, 
Heegner, and Stark that if D = 3 (mod 4), then 



1 + iVZ) 



is a principal ideal domain D = 3,7, 11, 19, 43, 67 and 163. 



In [3] , pg. 271-274, Cox gives an account of the proof and an overview of the history 
of this result. 

Euler's polynomial is not unique in taking on a long string of prime values. 
Indeed, the polynomial 36a; 2 — 810a; + 2753 (discovered by R. Ruby, see [5], pg- 112) 
takes on 45 distinct consecutive prime values (in absolute value) for < x < 44. 
However, for polynomials of the form x 2 + x + A, Euler's polynomial still holds the 
record. 

In 1923, Hardy and Littlewood [5] stated a number of precise conjectures about 
the distribution of primes satisfying various additional conditions. Their prime k- 
tuples conjecture implies that for any positive integer m, there is a number A so 
that 

x 2 + x + A is prime for < x < m. 
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In other words, with a large enough choice of A, Euler's polynomial can be beaten. 
In addition, they stated a precise conjecture about how frequent prime values of a 
fixed quadratic polynomial f(x) are. 

Conjecture (See Conjecture F in [9], pg. 190). Let a,b,c € Z with a > 0, 
gcd(a, b, c) = 1, b 2 — 4ac not a square where a+b and c are not both even. Let f(x) — 
ax 2 + bx + c and let nf(x) = #{p < x : p is prime and p = f(n) for some n € Z}. 
Then, 

"/w-^i^T n 



Here 



a log(x) p . 

p\ gcd(a,fc) 



b is odd 
b is even, 




fb^-Aa 



p>2 \ 

denotes the Legendre symbol. 



p-1 



For f(x) — x 2 +x+Al, the conjecture predicts that 717(2;) ~ (6.6395464...)- 



and the large value of the constant C arises because the values of x 2 + x + 41 are 
never multiples of primes p < 41. See the papers [4] and j6] for computations of 
larger values of A for which the corresponding value of the constant C is large. 

The goal of the present paper is to provide some verification of the conjecture of 
Hardy and Littlewood by searching for large prime values of x 2 + x + 41. To state 
our main result, recall that n# is the product of primes less than or equal to n. 

Theorem 1. Let f(x) = x 2 + x + 41 and g(x) = 40x 3 + 41x 2 + 42x +1. If we set 

_ 310927391 ■ 23143# 
X ~ 43 ' 

then f{g{x)) is a 60, 000 digit prime number. 

The fastest current methods for proving primality of large numbers N are based 
on knowing partial prime factorizations of N — 1 or N + 1. The best general 
primality proving method not based on factorizations is the elliptic curve primality 
proving method (ECPP), and the current record for a general number is 26,642 
digits. This number was proven prime by Frangois Morain in 2011. (Note that in 
PQ , primality of numbers in a very particular sequence is proven using ECPP. Some 
of these numbers have more than 100,000 digits, but this method does not apply 
in general.) 

Prime numbers of the form x 2 + 1 have received special attention, and the largest 
known such prime is (as of this writing) 75898 524288 + 1, with 2,558,647 digits. It 
is straightforward to find large primes of this type since for a number of the form 
x 2 + 1 we can factor TV — 1 as long as we know the prime factorization of x. 

This is not the case for f(x) = x 2 +2; + 41. Our approach to finding large primes 
of this type is to find polynomials g(x) so that f(g(x)) — 1 is reducible. A computer 
search revealed the choice g(x) = 40x 3 + 41x 2 + A2x + 1 for which 

f(g(x)) - 1 = (40a; 2 + x + l)(40a; 4 + 81x 3 + I23x 2 + 84a; + 42). 
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The Brillhart-Lehmer-Selfridge theorem (see [2], Theorem 5 or Section 2) allows 
one to prove the primality of N provided we know the complete factorization of a 
factor F of N — 1 with F larger than about N 1 / 3 . Our goal then is to find a choice 
of x (with known prime factorization) for which 40a; 2 + x + 1 is prime (proven again 
using the Brillhart-Lehmer-Selfridge theorem), and for which f(g(x)) is prime. This 
simultaneous primality requirement significantly increases the number of candidate 
values of x we must search, and makes our result comparable in difficulty to finding 
a large twin prime or Sophie Germain prime. 

An outline of the paper is as follows. In the second section, we give appropri- 
ate background, and in the third section we describe our computations and the 
verification of the primality of our 60,000 digit number of the form x 2 + x + 41. 

Acknowledgements. We used PARI/GP [7j for sieving computations, OpenPFGW 
for primality testing, and the Wake Forest DEAC cluster for primality testing com- 
putations. We would like to thank David Chin for compiling OpenPFGW for us on 
the DEAC cluster. 

2. Strategy 

We start by stating the Brillhart-Lchmcr-Sclfridge theorem. 

Theorem 2 (A special case of Theorem 5 of [2]). Suppose that N > 1 is odd and 

write N — 1 = FR where F is even and the prime factorization of F is known. 
Suppose also that 

(1) F > (f )V3, 

(2) For each prime pi dividing F , there is an integer ai so that di 1 = 1 

JV-l 

(mod TV) and gcd(a t *>< — 1, N) = 1, 

(3) // we write R — 2Fq + r, where 1 < r < 2F, then either q — or r 2 — 8q 
is not a perfect square, 

then N is prime. 

In order to take advantage of the Brillhart-Lehmer-Selfridge Theorem for prov- 
ing primality, we used the following equations 

f{g(x)) - 1 = h(x)i(x) 

h(x) = 40x 2 + x + I 

i(x) = 40x 4 + 81a; 3 + 123x 2 + 84a; + 42, 

with f(x) and g(x) defined above. Thus for any given choice of x, with x even, we 
have N = f(g(x)) with 2h{x) = F and ^ = R. 

Next, we estimate how many numbers we will have to test. According to the 
Prime Number Theorem, the density of primes close to an integer N is approxi- 
mately equal to ^nvl • Since we were looking for a 20,000 digit number and the 
corresponding 60,000 digit number to be simultaneously prime, if we assume the 
same density of primes within the values of Euler's Polynomial as within the set 
of all integers, we must multiply the probability of finding a 20,000 digit prime 
with the probability of finding a 60,000 digit prime. Thus our expected probabil- 
ity for a given pair of values (corresponding to h(x) and f(g(x)) from above) is 
approximately 
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( ln(10 2oooo))( ln(10 Boooo)) ~ 6,362,314,060' 

If we were to test 6,362,314,060 numbers, our chance of finding at least one prime 
pair would be 

-, _ / 6,362,314,059 ^6.362,314.060 
v 6,362,314,060 ' 

We know that 1 — ((N — l)/N) N thus 

-, _ i 6,362,314,059 \6, 362, 314, 060 ^ i _ 1 ^ fi<3 oCK 
1 v 6,362,314,060 J ~ 1 e ~uo.^/o. 

We chose to broaden our search in order to have a higher theoretical probability of 
success. We wanted to be roughly 95% confident that our search will yield success, 
thus we tripled our amount of numbers to check. 

1 _ / 6,362,314,059 \3-6. 362,314,060 ^ 1 _ (l\3 ^ nc nW 
1 V 6,362,314,060 J ~ 1 \e> ~ aJ - u/0 - 

Since the equation we were working with was 

f{g{x)) = 1600a; 6 + 3280a; 5 + 5041a; 4 + 3564a; 3 + 1887a; 2 + 126a; + 43, 
we chose to use a primorial divided by 43 as our x. By working with an integer 
multiple of a primorial as our value for x, x — k ■ we know that f(g(x)) will 
not be divisible by any prime less than n. This is because we know that if x = 
(mod p), then h(x) = 1 (mod p) and f(g(x)) = 43 (modp). Since f(g(x)) = 43 
(mod p) if x = (mod p), we do not want s to be a multiple of 43. 

For each potential prime divisor eliminated, the number of potential primes 
decreases by | where p is the divisor eliminated. This is due to the fact that the 
density of numbers n divisible by p is - . Thus the number of numbers we should 
check should be 

dip p rim c<23,i4 3 P ~f-f (3 • 6, 362, 314, 060) « 59, 481, 223, 

due to our use of 23 ^ 3# as a factor of x. 

Next, we chose to further reduce our search by sieving the test numbers. We 
eliminated all numbers that were divisible by primes under 5 • 10 10 . The number of 
numbers left after sieving up to 5 • 10 10 is approximately 



3- (6,362,314,060)- J| ^1 - i 



p prime 

p^s-io 1 



Mortens 's theorem states that 



n 

p prime 

p<x 



i-I 



p ) e 1 ln(x) 



Using this approximation, we estimate that 9,914,204 numbers would remain after 
sieving up to 5 • 10 10 . 

Finally, we estimate how much CPU time we will need to use. On our computers, 
primality tests took approximately 14 seconds for 20,000 digit numbers, and 123 
seconds for 60,000 digit numbers. This left us with the following preliminary CPU 
time estimations 
Without utilizing primorials: 

3 • 6, 362, 314, 060 • 14 seconds + 3 ln(10 60000 ) • 123 seconds « 8, 475 years. 
Using Primorials Pre-Sieve: 

59, 481, 223 • 14 seconds + 59, 481, 223/ ln(10 20000 ) • 123 seconds w 26 years. 
After Sieving: 
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9, 914, 204 • 14 seconds + 9, 914, 204/ ln(10 20000 ) • 123 seconds w 4 years. 

3. Computations 

We begin by describing how the sieving was done. In f(g(x)) we had x = 
fc( 23 '*f # ), thus the sieving was designed to eliminate values of k for which f(g(x)) 
was a multiple of a prime p. To do this, we looked for roots of the polynomials, 
f(x) and f(g(x)), in F p . We implemented the sieving through a program we wrote 
to run through Pari/GP. For each prime, p, in our testing range (below 5 • 10 10 ) 
we took the following steps. First, we factored 40a; 2 + x + 1 mod p and computed 
its roots in ¥ p . Then, we eliminated the choices of k for which the corresponding x 
value is a root. Once we had completed this process, we then repeated this process 
on 1600a; 6 + 3280a; 5 + 5041a; 4 + 3564a; 3 + 1887a; 2 + 126a; + 43, once again eliminating 
roots in ¥ p . 

Next, we examine our sieving results. We sieved up to 5 • 10 10 and brought 
our total number of numbers down from 60,000,000 to 9,946,272. This is just over 
30,000 more than we had estimated would be left after sieving. This process took 
approximately five days running on a single computer. 

Next, we describe how the pseudo-primality testing was done. We ran Format 
pseudo primality tests to find probable primes base 3 on OpenPFGW. OpenPFGW 
uses the fast Fourier transform (FFT) method for fast multiplication. We ran 
our tests in groups of around 900 numbers on the DEAC cluster, which has ap- 
proximately 1,200 nodes with processors ranging from 2.4 GHz to 3.0 GHz. Wc 
automated job submission to the cluster by checking how many jobs were cur- 
rently running and how many nodes were in use and making appropriate choices 
for submitting more jobs based on that information. When a pseudo-prime was 
found, that number was appended to a file, giving us a single list of all 20,000 digit 
pseudo-primes found. While the tests were running, we periodically checked the 
corresponding 60,000 digit numbers for pseudo-primality base 3. 

Finally, we examine our computation results. We tested a bit more than 3,000 
of the resulting 60,000 digit numbers and found one pseudo-prime. We stopped 
running tests once we successfully confirmed through OpenPFGW that the 60,000 
digit pseudo-prime was prime. We had found 3,521 20,000 digit pseudo-primes, 
and the 2,813th one corresponded to our 60,000 digit prime. Of the 9,946,271 
numbers post-sieving, the 20,000 digit number yielding the 60,000 digit prime was 
the 3,106,282nd. The total amount of CPU time that was used was: 5 days for 
sieving, 14 • (3, 300, 000) seconds for 20,000 digit pseudo-primality tests, and 123 • 
(3, 000) seconds for 60,000 digit pseudo-primality tests, totalling approximately 544 
days. 
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